Instrument Identification in Optical Music Recognition
نویسندگان
چکیده
We present a method for recognizing and interpreting the text labels for the instruments in an orchestra score, thereby associating staves with instruments. This task is one of many necessary in optical music recognition. Our approach treats the score system as the basic unit of processing. A graph structure describes the possible orderings of instruments in the system. Each instrument may apply to several staves, may be represented with several possible text strings, and may appear at several possible positions relative to the staves. We find the optimal labeling of staves using a globally optimal dynamic programming approach that embeds simple template-based optical character recognition within the overall recognition scheme. When given an entire score, we simultaneously optimize on the text labeling for each system, as well as the character template models, thus adapting to the font at hand. Our implementation alternately optimizes over the text label identification and re-estimates the character templates. Experiments are presented on 10 different scores showing a significant improvement due to adaptation.
منابع مشابه
Music Instrument Identification Using MFCC: Erhu as an Example
In the analysis of musical acoustics, we usually use the power spectrum to describe the difference between timbres from two music instruments. However, according to our experiments, the power spectrum cannot be used as effective features for erhu instrument identification. In this paper, we use MFCC (mel-scale frequency cepstral coefficients) as features for music instrument identification usin...
متن کاملOptical Music Recognition CS 194-26 Final Project Report
Optical Music Recognition (OMR), or alternatively sometimes referred to as Music Optical Character Recognition, is a system for music score recognition. Given a music sheet, usually in the form of an image, the goal of an OMR system is to use various vision algorithms to interpret the corresponding music symbols into digital form, typically playable in the form of a MIDI file. For this project,...
متن کاملAnalysis of Audio Descriptor Contribution in Singer Identification Process
An audio descriptor describes the information of an audio signal in a compact and precise representation. There are various standards available to extract the audio information in various ways to be used for particular applications such as, speaker recognition, musical instrument identification, singer identification, multimedia database indexing, genre detection and so on. There are various au...
متن کاملAutomatic Instrument Recognition in a Polyphonic Mixture Using Sparse Representations
In this paper, a method to address the automatic instrument recognition in polyphonic music is introduced. It is based on the decomposition of the music signal with instrument-specific harmonic atoms, yielding to an approximate object representation of the signal. A post-processing is then applied to exhibit ensemble saliences that give clues about the number of instrument playing and the instr...
متن کاملInstrogram: Probabilistic Representation of Instrument Existence for Polyphonic Music
This paper presents a new technique for recognizing musical instruments in polyphonic music. Since conventional musical instrument recognition in polyphonic music is performed notewise, i.e., for each note, accurate estimation of the onset time and fundamental frequency (F0) of each note is required. However, these estimations are generally not easy in polyphonic music, and thus estimation erro...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015